Rank in Wordlist | Frequency | Word |
---|---|---|
4653 | 23 | 10,000 |
5286 | 20 | 1,000 |
5292 | 20 | 50,000 |
6901 | 15 | 20,000 |
7350 | 14 | 3,000 |
7855 | 13 | 5,000 |
8454 | 12 | 1,00,000 |
8468 | 12 | 25,000 |
9172 | 11 | 2,000 |
9179 | 11 | 3,500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3804 | 29 | 10% |
4658 | 23 | 50% |
5079 | 21 | 15% |
5084 | 21 | 20% |
5544 | 19 | 80% |
5815 | 18 | 5% |
6902 | 15 | 25% |
6903 | 15 | 40% |
7331 | 14 | 1% |
7352 | 14 | 59.5% |
Rank in Wordlist | Frequency | Word |
---|---|---|
69331 | 1 | red&blue |
72476 | 1 | అధినేయ&మ్ |
86815 | 1 | ఎల్&టి |
111275 | 1 | జమ్ము&కాశ్మీరు |
Rank in Wordlist | Frequency | Word |
---|---|---|
68307 | 1 | US$48 |
68308 | 1 | US$498.5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
880 | 117 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
5804 | 18 | .' |
50471 | 2 | పాట'శాల |
62048 | 1 | 14'రాజు |
63402 | 1 | 21°14'ఉ |
63403 | 1 | 21°31'30 |
63479 | 1 | 22°29'26 |
63542 | 1 | 23°09'34 |
64229 | 1 | 34'00 |
64373 | 1 | 38'తూ |
64731 | 1 | 47'E |
Rank in Wordlist | Frequency | Word |
---|---|---|
65005 | 1 | 55+డిగ్రీలు |
66361 | 1 | An+B |
66424 | 1 | B1913+16ను |
67264 | 1 | J05551028+0724255 |
68294 | 1 | U+0C14 |
68330 | 1 | V+dV;కింది |
68480 | 1 | a+b+c |
99808 | 1 | క్లిండామైసిన్+అయిసోట్రిటినోయిన్ |
112609 | 1 | జీ+5 |
120291 | 1 | తెలుగు+హాలీవుడ్ |
Rank in Wordlist | Frequency | Word |
---|---|---|
8474 | 12 | https://www |
15643 | 6 | 1/2 |
15703 | 6 | 33/11 |
18294 | 5 | 3/5 |
18360 | 5 | http://censusindia |
20516 | 5 | బావులు/గొట్టపు |
22076 | 4 | https://web |
23480 | 4 | గ్రాములు/సెం |
34922 | 3 | రహదారి/ |
35688 | 3 | విజయవాడ/పెనమలూరు |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots